Self-taught Learning for Classification of Mass Spectrometry Data: A Case Study of Colorectal Cancer

نویسنده

  • Theodore Alexandrov
چکیده

Mass spectrometry is an important technique for chemical profiling and is a major tool in proteomics, a discipline interested in large-scale studies of proteins expressed by an organism. In this paper we propose using a sparse coding algorithm for classification of mass spectrometry serum protein profiles of colorectal cancer patients and healthy individuals following the so-called self-taught learning approach. Being applied to the dataset of 112 spectra of length 4731 bins, the sparse coding algorithm represents each of them by means of less then ten prototype spectra. The classification of spectra is done as in our previous study on the same dataset [ADM09], using Support Vector Machines evaluated by means of the double cross-validation. However, the classifiers take as input not discrete wavelet coefficients but the sparse coding coefficients. Comparing the classification results with reference results, we show that providing the same total recognition rate, the sparse coding-based procedure leads to higher generalization performance. Moreover, we propose using the sparse coding coefficients for clustering of mass spectra and demonstrate that this approach allows one to highlight differences between the cancer spectra.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of health belief model to identify predictors of colorectal cancer screening intention

Introduction: Belief in the usefulness and effectiveness of screening along with demographic characteristics are among the reasons for doing screening. The main purpose of this study was to identify the predictors of intention for doing colorectal cancer screening using health belief model and demographic characteristics. Materials and Methods: The present study is an analytical cross-sectional...

متن کامل

Self-perceived Mental Health Status and Uptake of Fecal Occult Blood Test for Colorectal Cancer Screening in Canada: A Cross-Sectional Study

Background While colorectal cancer (CRC) is one of the most preventable causes of cancer mortality, it is one of the leading causes of cancer death in Canada where CRC screening uptake is suboptimal. Given the increased rate of mortality and morbidity among mental health patients, their condition could be a potential barrier to CRC screening due to greater difficulties in adhering to behaviours...

متن کامل

Investigation of the effect of the family-centered empowerment model on the self-care ability of patients with colorectal cancer

Background and Aim: Colorectal cancer is a chronic disease that reduces the patients’ quality of life. Therefore, it is essential to have self-care ability. This study aimed to investigate the effect of the family-centered empowerment model on the self-care ability of patients with colorectal cancer. Materials and Methods: This randomized controlled study consisted of all patients who referred...

متن کامل

Metabolites of tobacco smoking and colorectal cancer risk.

Colorectal cancer is not strictly considered a tobacco-related malignancy, but modest associations have emerged from large meta-analyses. Most studies, however, use self-reported data, which are subject to misclassification. Biomarkers of tobacco exposure may reduce misclassification and provide insight into metabolic variability that potentially influences carcinogenesis. Our aim was to identi...

متن کامل

بررسی بقای بیماران مبتلا به سرطان کولورکتال با استفاده از مدل ریسک رقابتی پارامتری

Background and Objective: Colorectal cancer is the most common cancer of digestive system in Iran. The incidence of this cancer has increased in recent years.The aim of this study was to evaluate the survival rate and to define the prognostic factors in Iranian colorectal cancer patients using competing risk model. Materials and Methods: Data were recorded from 1060 patients with colorec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009